Constraint‐based multi‐agent reinforcement learning for collaborative tasks
نویسندگان
چکیده
In order to be successfully executed, collaborative tasks performed by two agents often require a cooperative strategy learned. this work, we propose constraint-based multi-agent reinforcement learning approach called constrained soft actor critic (C-MSAC) train control policies for simulated performing multi-phase tasks. Given task with n $$ phases, the first − 1 n-1 phases are treated as constraints final phase objective, which is addressed centralized training and decentralized execution approach. We highlight our framework on tray balancing including phases: lifting target following. evaluate proposed compare it against its unconstrained variant (MSAC). The comparisons show that C-MSAC leads higher success rates, more robust policies, better generalization performance.
منابع مشابه
Collaborative Multiagent Reinforcement Learning by Payoff Propagation
In this article we describe a set of scalable techniques for learning the behavior of a group of agents in a collaborative multiagent setting. As a basis we use the framework of coordination graphs of Guestrin, Koller, and Parr (2002a) which exploits the dependencies between agents to decompose the global payoff function into a sum of local terms. First, we deal with the single-state case and d...
متن کاملTransfer Learning for Multiagent Reinforcement Learning Systems
Reinforcement learning methods have successfully been applied to build autonomous agents that solve many sequential decision making problems. However, agents need a long time to learn a suitable policy, specially when multiple autonomous agents are in the environment. This research aims to propose a Transfer Learning (TL) framework to accelerate learning by exploiting two knowledge sources: (i)...
متن کاملAsymmetric Multiagent Reinforcement Learning
A novel model for asymmetric multiagent reinforcement learning is introduced in this paper. The model addresses the problem where the information states of the agents involved in the learning task are not equal; some agents (leaders) have information how their opponents (followers) will select their actions and based on this information leaders encourage followers to select actions that lead to...
متن کاملA multiagent architecture for concurrent reinforcement learning
In this paper we propose a multiagent architecture for implementing concurrent reinforcement learning, an approach where several agents, sharing the same environment, perceptions and actions, work towards one only objective: learning a single value function. We present encouraging experimental results derived from the initial phase of our research on the combination of concurrent reinforcement ...
متن کاملScalable Bayesian Reinforcement Learning for Multiagent POMDPs
Bayesian methods for reinforcement learning (RL) allow model uncertainty to be considered explicitly and offer a principled way of dealing with the exploration/exploitation tradeoff. However, for multiagent systems there have been few such approaches, and none of them apply to problems with state uncertainty. In this paper, we fill this gap by proposing a Bayesian RL framework for multiagent pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Animation and Virtual Worlds
سال: 2023
ISSN: ['1546-427X', '1546-4261']
DOI: https://doi.org/10.1002/cav.2182